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Abstract 

The absent-minded driver's problem illustrates that probabilistic strategies can 
give higher pay-offs than deterministic ones. We show that there are strategies 
using quantum entangled states that give even higher pay-offs, both for the original 
problem and for the generalized version with an arbitrary number of intersections 
and any possible set of pay-offs. 
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1 Introduction 

1.1 The absent-minded driver's problem 

The so-called paradox of the absent-minded driver was introduced by Piccione 
and Rubinstein in [1] and further discussed in [2,3] and references therein: an 
individual is sitting late at night in a bar planning his midnight trip home. 
The trip starts at the bar, the start in Fig. 1. There is a highway with 
two consecutive exits (or intersections), X and Y, and he has to take the 
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second, Y, to get home (pay-off 4). If he takes the first one, he arrives at a 
bad neighborhood (pay-off 0), and if he fails to take either, he has to stay in 
a motel at the end of the highway (pay-off 1). He cannot go back. There are 
two essential assumptions: 

(I) Indistinguishability: The intersections X and Y are indistinguishable by any 
experiment performed at one intersection. When the driver is at one inter- 
section, no experiment can give him information about which intersection 
he is at. 

(II) Absent-mindedness: The driver is absent-minded and is aware of this fact. 
Absent-mindedness only affects his memories about whether he has already 
gone through one of the intersections; at X he knows he might be at Y but 
has forgotten passing X, and at Y he cannot remember passing X. Apart 
from this, the driver is perfectly able to remember anything else. 

Some remarks about these assumptions follow: 

(i) These assumptions are not independent: the indistinguishability of the inter- 
sections is only relevant for an absent-minded driver, and absent-mindedness 
is only relevant when the intersections are indistinguishable (otherwise, the 
driver could obtain information for decision-making in spite of his absent- 
mindedness). 

(ii) Since the driver's absent-mindedness is limited to his memories about the 
intersections, but does not prevent him from possessing information about 
the rest of the universe, then it is forbidden any experiment at one inter- 
section whose result, together with any information about the rest of the 
universe, allows the driver to obtain information about which intersection 
he is at. 

(iii) Implicit in the rules is the fact that the driver cannot transmit informa- 
tion from one intersection to the rest of the universe (and, in particular, 
to the other intersection), because this could be used to distinguish the 
intersections. 

The above scenario allows Piccione and Rubinstein to exhibit a conflict be- 
tween two ways of reasoning at an intersection: 

"Planning his trip at the bar, the decision maker must conclude that it is 
impossible for him to get home and that he should not exit when reaching an 
intersection. Thus, his optimal plan will lead him to spend the night at the 
motel and yield a payoff of 1. Now suppose that he reaches an intersection. 
If he had decided to exit, he would have concluded that he is at the first 
intersection. Having chosen the strategy to continue, he concludes that he 
is at the first intersection with probability 1/2. Then, reviewing his plan, 
he finds that it is optimal for him to leave the highway since it yields an 
expected payoff of 2. Despite no new information and no change in his 
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Fig. 1. The absent-minded driver's problem. 

preferences, the decision maker would like to change his initial plan once he 
reaches an intersection! [1]" 

Piccione and Rubinstein make use of this apparent paradox to illustrate the 
advantages of probabilistic (or random [1], or mixed) strategies versus deter- 
ministic (or pure [1]) strategies. At the intersections, the driver can either 
continue along or exit the highway. Accordingly, there are are two possible 
deterministic strategies: either to always continue (pay-off 1) or to always 
exit (pay-off 0). Alternatively, at the intersections, the driver can toss a (suit- 
able weighted) coin with a probability p for heads (which means continue) 
and a probability 1 — p for tails (which means exit) . The expected pay-off of 
this probabilistic strategy is 4p (1 — p) + p 2 . Therefore, if p > 1/3, this strat- 
egy gives a higher pay-off than the best deterministic strategy. The optimal 
probabilistic strategy consists of choosing p = 2/3 (pay-off 4/3). 

1.2 Quantum strategies 

Game theory has recently found a new direction based on the possibility of 
the resources of quantum mechanics becoming available to the players [4]. 
This introduces new possibilities which lead to advantages over their clas- 
sical counterparts (however, see [5]). So far, the advantages of the so-called 
quantum strategies have been limited to situations involving players with per- 
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feet recall [6,7,8,9,10,11,12]. As we shall illustrate, decision problems involving 
players with imperfect recall are a promising arena for the application of quan- 
tum resources. In particular, we will see that quantum mechanics allows the 
absent-minded driver to make use of correlations without compromising the 
assumptions (or rules) of the problem and thereby obtain higher pay-offs. 



2 Physical realization of probabilistic strategies 

Let us start by re-examining from the perspective of physics what Piccione 
and Rubinstein consider the optimal solution to the absent-minded driver's 
problem: a probabilistic strategy. Can we implement a probabilistic strategy 
in a real experiment without compromising the assumptions of the problem 
at a fundamental level? In particular, how can we accomplish the driver's 
action of "tossing a coin" in the intersection satisfying the requirement of 
even-in-principle indistinguishability of the intersections? Strictly speaking, 
the driver is not allowed to carry a coin since he could use it to bypass his 
absentmindedness and keep track of the intersections he passes. However, the 
driver may instead place a coin in each of the intersections before the trip 
starts. 

The first problem is which physical system can be used as a coin. Strictly 
speaking, real coins are highly complex systems and therefore, in principle, 
it is not difficult to distinguish between two of them. This problem can be 
eluded by replacing real coins by identical physical systems (i.e., those with 
the same composition of identical elementary particles and prepared in the 
same physical state). 

The next problem is the mechanism for "tossing" the coin. By "tossing a coin" 
we mean that the driver can use his coin to perform a two-output experiment 
with probability p for one of the outputs. However, this is a very difficult task 
from the perspective of classical physics, where the results of experiments are 
essentially predefined in the state of the coin and the device used to toss it. In 
this scenario both are under the driver's control, and therefore the result of the 
experiment can in principle be fixed by the driver beforehand. In quantum me- 
chanics, however, it can be proven, under some general assumptions, that the 
results of certain experiments are not predefined [13,14], but are created [15] 
when the experiment is performed. Therefore, at least from the perspective of 
quantum mechanics, there is a method for tossing a coin without compromis- 
ing the assumptions of the absent-minded driver's problem at a fundamental 
level. The driver can place a "quantum coin" [6] or qubit (i.e., a two- level 
quantum system), say a spin-| particle, in each of the intersections (X and 
Y). Let us denote the quantum state of the two quantum coins as pxy- 
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The indistinguishability assumption implies that the quantum state of each 
quantum coin is represented by the same reduced density matrix, tixPxY — 
trypxy- Moreover, states with unequal reduced density matrices hidden in 
classical mixtures (i.e., states of the form p X y = X^PilV'xyXV'xy I' wnere > in 
some instances, i = a, the quantum coins are in different states, trxipxY 
tiYipxy) are forbidden, since there is no way to ensure that the driver does 
not have a priori knowledge on the particular instance that will appear in 
each intersection in a particular trip. This knowledge would allow the driver 
to learn, with a given probability, which intersection he is at. Hence this does 
not fulfill the assumption of indistinguishability. 

A probabilistic strategy can be implemented right away by a suitable mea- 
surement on a suitable quantum state. For instance, by preparing the qubits 
in the pure state 

1$) = (1) 
where 

i0) = v /pio> + v /r~Hi>, ( 2 ) 

and measuring at the intersection the observable 

a 2 = |0)(0|-|l)(l|. (3) 

When the driver arrives to any intersection, he measures a z , and the prob- 
ability of obtaining continue is p, and the probability of obtaining exit 
is 1 — p. 



3 Entanglement-assisted strategies 



At this point, one can think about the possibility of using more complex quan- 
tum mechanical resources instead of just experiments on quantum systems as 
generators of random numbers. Indeed, here we show that there are quantum 
strategies which give higher expected pay-offs than any probabilistic one. For 
instance, let us prepare two qubits in the singlet state 

|*> = ^(|01>-|10», (4) 



and put one qubit in each intersection. The reduced states of each qubit are 
both maximally mixed and cannot be described in terms of classical correla- 
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tions [13]. When the driver arrives at the first intersection, he measures cr z , 
and the probability of obtaining either continue or exit is 1/2. However, 
by using the singlet state, the driver has managed to introduce correlations 
between both intersections without violating the assumptions of the absent- 
minded driver's scenario: whenever he obtains continue in the first intersec- 
tion, he obtains exit in the second, and vice-versa. Therefore, the expected 
pay-off is 2, which is higher than the highest expected pay-off (4/3) for a prob- 
abilistic strategy. Note, that as mentioned earlier, the use of pseudo-random 
(i.e., deterministic) procedures to generate ant i- correlations would immedi- 
ately lead to distinguishable intersections: in principle the driver could be in 
possession of the "seed" and would be able to recognize the intersections by 
the different outcomes of their pseudo-random number generators. 

Moreover, the previous strategy achieves not only a higher pay-off than any 
probabilistic strategy, but actually the optimal pay-off for the absent-minded 
driver's problem. This can be proven as follows. The driver has complete 
knowledge of the state of the universe before the game starts, hence we can as- 
sume that the initial state is pure \<&)xyr, where R refers to the Hilbert space 
of the rest of the universe. During the game he will have restricted access to 
that state via sequential local operations on systems X and Y . Moreover, the 
driver lacks any memory or reference that tells him which intersection he is at 
and therefore cannot choose a strategy accordingly. With all the above consid- 
erations we can now assume without loss of generality that an optimal strategy 
for the absent-minded driver will consist on preparing the quantum systems 
in XY in a pure state and, on his arrival at an intersection, performing 
a measurement in a fixed basis, let us say the computational basis (|0), |1)}. 
Indeed, such a scheme will always be able to provide the same correlations be- 
tween the measurement outcomes than the most general scheme, and therefore 
provide the optimal pay-off. We now have to maximize over the possible states 
= Y^l j=o a ij\ij)- Since the coherences do not affect the pay-off, it suffices 
to look at the probability distribution {|ajj| 2 } of distinct classical instruction 
sets {(i, j)} and check for full compliance of the rules when the phases are 
picked. Any instruction that does not allow the driver to distinguish between 
the intersections can be expressed as a convex linear combination of three ba- 
sic strategies. The first is the instruction to continue in both intersections 
(pay-off Pi = 1), represented by (0, 0); the second is to exit in both intersec- 
tions (pay-off P 2 = 0), represented by (1, 1); and the third gives continue 
in X and exit in Y for half of the runs, and exit in X and continue in Y 
for the other half (pay-off P 3 = 2), and can be represented by (0, 1) + (1,0). 
The optimal strategy should maximize P = J2^=iPjPji where the probabilities 
are positive, pj > 0, and normalized, J2^=\Pi — 1- Since the expected pay-off 
itself is a convex function, it is clear that its maximum value will be achieved 
with the extreme strategy of highest pay-off; that is, the optimal pay-off is 
P = P 3 = 2 and can be reached with the quantum state of Eq. (4). 
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Fig. 2. The absent-minded driver's problem with three intersections. 



A natural question is whether or not the same maximum pay-off could be 
achieved with less quantum resources. Can we mimic the strategy using the 
state (4) by means of a single qubit (as a source of genuine randomness) cou- 
pled to a classical mechanism to generate the desired correlations? The answer 
is no. On one hand, obtaining the maximum pay-off does not require perfect 
correlations but perfect anti- correlations between the results on both inter- 
sections, and any classical mechanism capable of generating different results 
would introduce a distinction between the intersections. On the other hand, 
such a classical mechanism could be used to transmit information between 
the intersections when the driver is inside one of them and thus violates (iii). 
Therefore we conclude that the strategy using two qubits in the state (4) is 
the simplest giving the maximum pay-off. At this point we also note that even 
if one could justify the use of pseudo-random number generators (or any other 
classical source of randomness), justifying the use of two generators that cre- 
ate correlated random outputs is much more demanding, as it immediately 
implies the existence of a reproducible and hence predictable mechanism of 
generating the outputs. 
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4 The absent-minded driver's problem with more intersections 



4-1 An example with three intersections 

Let us now consider the absent-minded driver's problem with N > 2 intersec- 
tions. Here we can restrict ourselves to strategies where, at each intersection, 
the driver measures in the computational basis the prepared iV-party pure 
state = J2vC*v\v), where {vl}f is the set of d = 2 N iV-dimensional binary 
arrays, and corresponds to the set of instructions {(a, b, e)}. Here, finding 
the optimal strategy also requires finding the vertices of the convex poly- 
tope that results from imposing indistinguishability conditions on the simplex 
probability space {p 1 = \a vi \ 2 , . . . ,Pd = |o^,J 2 }. The condition for the indis- 
tinguishability of the intersections is that the reduced density matrices in each 
intersection have to be identical p\ — . . . — pjy. In the computational basis, 
the diagonal matrix elements are equal if and only if all components of the 
vector u = J2 m PmV m are identical, i.e., 

u(i) = u(j) Vi,j = l,...,2 N . (5) 

For the moment we will disregard the conditions that arise from equating the 
out-of-diagonal terms of the reduced density matrices. 

Any legitimate solution can be expressed as a convex combination of the so- 
lutions associated to each vertex of the polytope. Each vertex E is specified 
by a vector p E in the probability space. As before, the optimal pay-off can be 
reached at one of the vertices, and the corresponding mixed set of instructions 
can be mapped into a quantum superposition by fixing the arbitrary phases. 
The extremal property of the vertices implies that all arrays contributing to a 
specific vertex, i.e., {v m : p^ 7^ 0} have to be linearly independent. Using this 
fact and the indistinguishability condition, it can be easily seen that two ar- 
rays contributing to a vertex cannot differ solely in a single component. That 
is, for any (N — l)-dimensional binary array x, (1, x) and (0, x) cannot appear 
together in the convex combination that defines a vertex. From this it follows 
that all the out-of-diagonal matrix elements of the reduced density operator 
Pi = t^2,...,NP E have to vanish, 

(l| Pl |0) = E(l^|0a ? ) = 0, (6) 

M 

where p E = \4> E )((fi E \ with \<p E ) = J2 m \Jv~^ Q% ^ m \vm) ■ Therefore, out-of-diagonal 
terms will not impose any additional constraints, and the optimal solution 
to the absent-minded driver's problem can always be chosen to be \(p E ) = 
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E m yPm el ^ m Wm) for a given vertex p E . A remarkable feature is that all the 
vertices except two, (0, . . . , 0) and (1, . . . , 1) — which represent the two possible 
deterministic strategies — , are implemented by means of entangled states. 

The particular solution for a given pay-off assignment can be obtained in a 
straightforward manner by standard linear programming methods. Obtaining 
the set of possible optimal solutions can be significantly more difficult. For 
this purpose one can use a method developed in the context of the theory 
of iV-party games. To every binary array v, one can associate a subset of 
P = {1, ...,JV}, S v — {i : v(i) = 1}. The condition of indistinguishability 
on the arrays {v} is equivalent, in game-theoretical terms, to saying that the 
collection of sets, or coalition of players, {S v } is balanced. The concept of a min- 
imally balanced collection [16,17], which corresponds to the set of contributing 
arrays in a vertex of the polytope, has found important applications for the 
theory of iV-party games. This theory does not apply to the absent-minded 
driver's problem; however, we can use an inductive method developed in that 
context [18] to generate the possible solutions (vertices) of the iV-intersection 
problem from those of the N — 1 problem. 

Let us consider the absent-minded driver's problem with three intersections [2] 
and the set of pay-offs given in Fig. 2. In this case, the vertices of the polytope 
of possible solutions are B l = (0, 0, 0), B 2 = (1, 1, 1), B 3 = (0, 0, 1) + (1, 1, 0), 
B 4 = (0, 1, 0) + (1, 0, 1), B 5 = (1, 0, 0) + (0, 1, 1), B 6 = (0, 0, 1) + (0, 1, 0) + 
(1,0,0), and B 7 = (0,1,1) + (1,0,1) + (1,1,0). It can be easily seen that, 
for any < n, there is a probabilistic strategy that gives a higher expected 
pay-off than any deterministic one (see Fig. 3). More interestingly, there is a 
quantum entanglements-assisted strategy that gives a higher pay-off than any 
probabilistic or deterministic strategy (Fig. 3). For < n < 2, the optimum 
strategy (pay-off 2) can be implemented by preparing three qubits in the so- 
called Greenberger-Horne-Zeilinger [14] state 

|GHZ> = ^(|001> + |110», (7) 



putting one qubit in each intersection and measuring the observable a z defined 
in Eq. (3). For 2 < n < 8, the optimum strategy (pay-off (4 + n)/3) can be 
implemented by preparing three qubits in the so-called W [19,20] state 

|W) = -^ (|001) + |010) + |100)), (8) 



followed by the a z measurement. Finally, for n > 8, the optimum pay-off 
(pay-off n/2) can be reached in a similar manner with GHZ-type states, for 
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Fig. 3. Optimal expected pay-offs as a function of n for deterministic (fine line, bot- 
tom), probabilistic (dashed line, in the middle), and quantum entanglement-assisted 
(thick line, top) strategies for the absent-minded driver's problem with three inter- 
sections of Fig. 2. 

instance, 

|GHZ') = i= (|011) + |100)). (9) 

At the moment we do not have a characterization of the full set of optimal 
solutions for the iV-intersection problem. From the previous reasoning we know 
that a necessary and sufficient condition for a solution to be optimal is that 
the contributing arrays are linearly independent. This also means that the 
superpositions will involve at most iV + 1 terms. We also know that, given 
a vertex (i.e., an optimal solution), if we exchange 0's for l's and vice versa, 
then we have another vertex. From the previous examples it might seem that 
optimal solutions always involve equally weighted superpositions. However, 
for N > 3 one can find optimal solutions for which this is not the case. 



5 Conclusions 

To sum up, the absent-minded driver's problem is that he does not know which 
intersection he is at and cannot have instructions indicating the route home. 
However, the assumptions under this scenario do not keep him from using 
certain quantum superpositions of instructions. We have seen that the use of 
entangled states allows him to increase the expected pay-off beyond what was 
previously considered the maximum pay-off. 

The key point to understand the quantum advantage is that the limited absent- 
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mindedness of the driver requires even-in-principle indistinguishability of the 
intersections; otherwise one cannot exclude the possibility that the driver pos- 
sesses some information of the state of the rest of the universe (i.e., by pre- 
arranging it before his trip) such that during his trip he can figure out what 
intersection he is at. Hence, classically correlated physical systems (and spe- 
cially anti-correlated systems) that could provide optimal expected pay-offs, 
are not allowed for decision-making. However, quantum states such as (4), (7), 
(8), (9), or those needed to implement all non-trivial vertices guarantee indis- 
tinguishability by fundamental laws of physics, while providing at the same 
time the desired correlations. According to quantum mechanics the quantum 
state of the A-qubit system is the most complete description of the system, in 
the sense that it provides the maximum possible information about any future 
experiments on these qubits. The proposed entangled states, hence guarantee 
that the absent-minded driver cannot determine at what intersection he is 
during his trip. In addition, these states hide the "correct" instruction sets 
in superpositions with non-optimal instructions, and thus provide high pay- 
off values, without transgressing the assumptions behind the absent-minded 
driver's scenario. 

The above examples suggest that strategies based on quantum entanglement 
can also provide advantages over classical strategies in other decision problems 
involving memory limitations or imperfect recall [1,2,3] in which the indistin- 
guishability of the alternatives plays an essential role. 
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